cient Computations on fault - prone BSP machines

نویسندگان

  • Spyros C. Kontogiannis
  • Grammati E. Pantziou
  • Paul G. Spirakis
چکیده

In this paper general simulations of algorithms designed for fully operational BSP machines on BSP machines with faulty processors or unavailable processors are developed. The fail-stop model is considered, that is, if a processor fails or becomes unavailable it remains so until the end of the computation. The faults are random, that is, a processor may fail independently with probablility a, a is a constant. Two possible settings for fault occurence are considered: the faults are either static (the faulty or unavailable processors are already known at the start of the computation) or dynamic (the processors become faulty or unavailable during the computation). In the case of static faults, a simulation of an n-processor fault-free BSP machine on a faulty n-processor BSP machine is presented with constant slowdown per local computation step and O(log n maxfL; gg) slowdown per communication step, given that a preprocessing has been done that needs O(log 2 n maxfL; gg) time. L and g are the parameters of the simulating BSP machine. In the case of dynamic faults, a simulation of an n-processor fault-free BSP machine on an cn logn-processor faulty BSP machine is presented. No dynamic faults may occur during certain periods of the simulation. The simulations are randomized and Monte Carlo: they are guaranteed to be correct with high probability, and the time bounds always hold. To our knowledge, no previous work on the fault tolerance of the BSP model exists.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Xxx Can We Use a Different Title That Formats Better? E.g. Adaptive Bsp on a Network of Workstaions?

Ÿ¡ S¢s£'¤ ¥S¦§£z ̈ XXX CAN WE USE A DIFFERENT TITLE THAT FORMATS BETTER? E.G. ADAPTIVE BSP ON A NETWORK OF WORKSTAIONS? XXX Several computing environments, including wide area networks and non-dedicated networks of workstations, are characterized by frequent unavailability of the participating machines. Parallel computations, with interdependencies among their component processes, cannot make pr...

متن کامل

Runtime Support for Virtual BSP Computer

Several computing environments including wide area networks and nondedicated networks of workstations are characterized by frequent unavailability of the participating machines. Parallel computations, with interdependencies among their component processes, can not make progress if some of the participating machines become unavailable during the computation. As a result, to deliver acceptable pe...

متن کامل

Pipelined Decomposable BSP Computers

The class of weak parallel machines is interesting, because it contains some realistic parallel machine models, especially suitable for pipelined computations. We prove that a modification of the bulk synchronous parallel (BSP) machine model, called decomposable BSP (dBSP), belongs to the class of weak parallel machines if restricted properly. We will also correct some earlier results about pip...

متن کامل

Fully-Scalable Fault-Tolerant Simulations for BSP and CGM

In this paper we consider general simulations of algorithms designed for fully operational BSP and CGM machines on machines with faulty processors. The faults are deterministic (i.e., worst-case distributions of faults are considered) and static (i.e., they do not change in the course of computation). We assume that a constant fraction of processors are faulty. We present a deterministic simula...

متن کامل

1 Adaptive Bulk - Synchronousparallelism in a Network Ofnondedicated

Several computing environments including wide area networks and nondedicated networks of workstations are characterized by frequent unavail-ability of the participating machines. Parallel computations, with interdepen-dencies among their component processes, can not make progress if some of the participating machines become unavailable during the computation. As a result , to deliver acceptable...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1996